3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
It will be available
License:
Size:
2 GByte Production Status:
Newly created-in progress
Use:
Word Sense Disambiguation
-
Paper title:Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yadollah Yaghoobzadeh | WIKI-PSE | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Multilingual
Languages:
Dutch English German
Availability:
From Data Center(s)
License:
CELEX Agreement
Size:
None Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Using LSTMs to Assess the Obligatoriness of Phonological Distinctive Features for Phonotactic Learning
-
Paper track:Long/Phonology, Morphology and Word Segmentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Nicole Mirea | CELEX2 | /N |
Documentation:
Yes, in English, publicly available
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
CC-BY-SA-4.0
Size:
1200 instances OtherProduction Status:
Newly created-finished
Use:
Emphasis Selection for Written Text
-
Paper title:Learning Emphasis Selection for Written Text in Visual Media from Crowd-Sourced Label Distributions
-
Paper track:Short/Applications
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Amirreza Shirani | Emphasis Selection Spark dataset | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Ancient Greek Arabic Chinese English Finnish Hebrew Korean Russian Swedish
Availability:
Freely Available
License:
CreativeCommons, Gnu
Size:
11814230 tokens Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:The (Non-)Utility of Structural Features in BiLSTM-based Dependency Parsers
-
Paper track:Long/Tagging, Chunking, Syntax and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Agnieszka Falenska | Universal Dependencies 2.0 | /N |
Documentation:
https://universaldependencies.org/v2/
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC User Agreement for Non-Members
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:The (Non-)Utility of Structural Features in BiLSTM-based Dependency Parsers
-
Paper track:Long/Tagging, Chunking, Syntax and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Agnieszka Falenska | Penn Treebank | /N |
Documentation:
https://catalog.ldc.upenn.edu/docs/LDC99T42/
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
4030 sentences Production Status:
Newly created-finished
Use:
Humour analysis
-
Paper title:Predicting Humorousness and Metaphor Novelty with Gaussian Process Preference Learning
-
Paper track:Long/Sentence-level semantics
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Edwin Simpson | Humour ranking dataset | /N |
Documentation:
Documentation is to be provided at the linked repository.
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference
-
Paper track:Long/Textual Inference and Other Areas of Semantics
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | R. Thomas McCoy | HANS dataset | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
34 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Symbolic Inductive Bias for Visually Grounded Learning of Spoken Language
-
Paper track:Long/Vision, Robotics, Multimodal, Grounding and Speec
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Grzegorz Chrupała | Flickr8k Audio Caption Corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
400000 sentences Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Reversing Gradients in Adversarial Domain Adaptation for Question Deduplication and Textual Entailment Tasks
-
Paper track:Short/Machine Learning
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sparsh Gupta | Quora Question Pairs | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
34013 sentences Production Status:
Newly created-finished
Use:
Argument Component Detection and Classification
-
Paper title:Yes, we can! Mining Arguments in 50 Years of US Presidential Campaign Debates
-
Paper track:Short/Sentiment Analysis and Argument Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Serena Villata | USElecDeb60To16 dataset | /N |
Documentation:
None




